What modes of action define participation in Gravity Spy?
What modes of action distinguish promoted contributors and those who remain at the same level?
What is the relationship between activities and performance?
Do volunteers maintain certain routines?
Motifs were built in chunks of five such that we would capture overlapping activities in motifs.
To reduce the number of motifs, we combined the number similar activities into a single motif such that five learning activities in a pattern of interactions would be represented as l5 and 12 learning interactions would be represented as l12.
For motifs, we used descriptive
statistics to understand common and uncommon modes of action (motif),
and whether a motif was rare as measured by term frequency inverse
document frequency (tfidf)
term frequency (tf) - how frequently a word occurs in a document. inverse document frequency (idf) which decreases the weight for commonly used words and increases the weight for words that are not used very much in a collection of documents. tf-idf (the two quantities multiplied together) indicated the frequency of a term adjusted for how rarely it is used.
To follow the analogy - a document is one of the grouping factors (e.g. level, user, promoted/not promted) and a term is a motif. In this analysis, we are only concerned with the series of activities in the set of five and not the order of those activities.
## Joining, by = "level"
datatable(freq_by_rank)## Warning in instance$preRenderHook(instance): It seems your data is too big
## for client-side DataTables. You may consider server-side processing: https://
## rstudio.github.io/DT/server.html
datatable(project_tf_idf)## Warning in instance$preRenderHook(instance): It seems your data is too big
## for client-side DataTables. You may consider server-side processing: https://
## rstudio.github.io/DT/server.html
project_tf_idf.viz